Inferring Semantic Information from Natural-Language Software Artifacts
نویسنده
چکیده
Code-level specifications play an important role in software engineering. In addition to guiding the development process by outlining what/how to reuse, specifications also help in verification process by allowing quality assurance practitioners to test the expected outcome. One of the valuable source of such specifications are the Natural language API documents. However, sometimes humans often overlook these documents and build software systems that are inconsistent with specifications described in those documents. While there are tools and frameworks available to assist humans to build/reuse quality software, these tools are not designed to work on specifications in natural language. To address this issue, this report presents a Natural Language Processing (NLP) framework to automate the task of inferring semantic information from natural language software artifacts to bridge the disconnect between the inputs required by software engineering tools/frameworks and the specifications described in natural language. This report is a part of a larger dissertation goal to improve developer / tester / end-user productivity by automatically inferring semantic information from the textual descriptions in software artifacts. Specifically, in this report I present two recent research efforts that I have conducted in developing/applying NLP techniques for inferring semantics from natural language software artifacts. Furthermore, I also outline my dissertation plan along with intermediate deliverables to achieve my goal.
منابع مشابه
Pandita, Rahul. Inferring Semantic Information from Natural-language Software Artifacts. (under the Direction of Laurie Williams.) Inferring Semantic Information from Natural-language Software Artifacts
PANDITA, RAHUL. Inferring Semantic Information fromNatural-Language Software Artifacts. (Under the direction of Laurie Williams.) Specifications play and important role in software engineering for ensuring software quality. Not only do the specifications guide the development process by outlining what/how to reuse, they also help in the verification process by allowing testers to test the expec...
متن کاملText mining and software engineering: an integrated source code and document analysis approach
Documents written in natural languages constitute a major part of the artifacts produced during the software engineering lifecycle. Especially during software maintenance or reverse engineering, semantic information conveyed in these documents can provide important knowledge for the software engineer. In this paper, we present a text mining system capable of populating a software ontology with ...
متن کاملAutomatic indexing of software artifacts
This paper describes the classification mechanism of ROSA, a software reuse system based on the processing of the natural language descriptions of software artifacts. The system supports the automatic indexing of components by acquiring lexical, syntactic and semantic information from software descriptions. Major goals of the approach are cost-effectiveness and retrieval precision. Some results...
متن کاملUsing english to retrieve software
This paper describes ROSA, a software reuse system based on the processing of the natural language descriptions of software artifacts. Lexical, syntactic and semantic analysis of software descriptions is performed to automatically extract both verbal and nominal phrases from descriptions and use this information to create frame-based indexing units for software components. Retrieval similarity ...
متن کاملA similarity measure for retrieving software artifacts
presents the mechanism for query processing and retrieval with the measures used for the similarity analysis of the indexing structures. Section 6 describes an experiment conducted to evaluate the effectiveness of the proposed approach. Section 7 summarizes related work in the area of re-use systems. Section 8 concludes the paper with some remarks on planned experiments with the system and furt...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013